Testing the Fault-Tolerance of Networked Systems
نویسندگان
چکیده
This paper presents an extensible framework for testing the behavior of networked machines running the Linux operating system in the presence of faults. The framework allows injection of a variety of faults, such as faults in the computing core or peripheral devices of a machine or faults in the network connecting the machines. The system under test as well as the faultand workload run on this system are configurable. The framework is supported by a graphical user interface for experiment control. We have tested the framework with a set of different fault injection experiments. The framework has proven to be stable and work as expected.
منابع مشابه
Modeling and Analysis of Distributed Reconfigurable Hardware∗
The ability to migrate hardware processes in a network of hardware reconfigurable nodes improves the fault tolerance of these networks. The degree of fault tolerance is inherent to such networked systems and can be optimized during design time. Therefore, an efficient way of calculating the degree of fault tolerance is needed. This paper presents an approach based on satisfiability testing whic...
متن کاملAn approach to fault detection and correction in design of systems using of Turbo codes
We present an approach to design of fault tolerant computing systems. In this paper, a technique is employed that enable the combination of several codes, in order to obtain flexibility in the design of error correcting codes. Code combining techniques are very effective, which one of these codes are turbo codes. The Algorithm-based fault tolerance techniques that to detect errors rely on the c...
متن کاملAdvanced design scheme for fault tolerant distributed networked control systems
This paper addresses the integrated design of fault tolerant distributed networked control systems (NCS). The NCS under consideration consists of two levels. At the lower level, sensors, actuators and local controllers are embedded and networked by sub-nets. They coordinated and supevised by the control stations located at the higher level. The core of the design scheme is the integrated design...
متن کاملThe Study for Guaranteed Cost Fault Tolerant Control of the Networked Control Systems
In this paper, the problem of guaranteed cost fault-tolerant control for networked control systems (NCSs) is discussed based on Lyapunov stability theory and Linear Matrix Inequality (LMI). The sufficient conditions possessing robust integrity against actuator failures are given by adopting memory state feedback control law, which can meet a cost function for closed-loop networked control syste...
متن کاملPerformance Evaluation of Fault Tolerance for Parallel Applications in Networked Environments
This paper presents the performance evaluation of a software fault manager for distributed applications. Dubbed STAR, it uses the natural redundancy existing in networks of workstations to offer a high level of fault tolerance. Fault management is transparent to the supported parallel applications. STAR is application independent, highly configurable and easily portable to UNIX-like operating s...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2002